Approximating Reversal Distance for Strings with Bounded Number of Duplicates

نویسنده

  • Petr Kolman
چکیده

For a string A = a1 . . . an, a reversal ρ(i, j), 1 ≤ i < j ≤ n, transforms the string A into a string A′ = a1 . . . ai−1ajaj−1 . . . aiaj+1 . . . an, that is, the reversal ρ(i, j) reverses the order of symbols in the substring ai . . . aj of A. In a case of signed strings, where each symbol is given a sign + or −, the reversal operation also flips the sign of each symbol in the reversed substring. Given two strings, A and B, signed or unsigned, sorting by reversals (SBR) is the problem of finding the minimum number of reversals that transform the string A into the string B. Traditionally, the problem was studied for permutations, that is, for strings in which every symbol appears exactly once. We consider a generalization of the problem, k-SBR, and allow each symbol to appear at most k times in each string, for some k ≥ 1. The main result of the paper is a simple O(k)-approximation algorithm running in time O(k · n). For instances with 3 < k ≤ O( √ log n log∗ n), this is the best known approximation algorithm for k-SBR and, moreover, it is faster than the previous best approximation algorithm. In particular, for k = O(1) which is of interest for DNA comparisons, we have a linear time O(1)-approximation algorithm.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Reversal Distances for Strings with Few Blocks or Small Alphabets

We study the String Reversal Distance problem, an extension of the well-known Sorting by Reversals problem. String Reversal Distance takes two strings S and T as input, and asks for a minimum number of reversals to obtain T from S. We consider four variants: String Reversal Distance, String Prefix Reversal Distance (in which any reversal must include the first letter of the string), and the sig...

متن کامل

Reversal Distance for Strings with Duplicates: Linear Time Approximation Using Hitting Set

In the last decade there has been an ongoing interest in string comparison problems; to a large extend the interest was stimulated by genome rearrangement problems in computational biology but related problems appear in many other areas of computer science. Particular attention has been given to the problem of sorting by reversals (SBR): given two strings, A and B, find the minimum number of re...

متن کامل

. C O ] 4 J ul 2 00 6 Prefix reversals on binary and ternary strings ⋆

Given a permutation π, the application of prefix reversal f (i) to π reverses the order of the first i elements of π. The problem of Sorting By Prefix Reversals (also known as pancake flipping), made famous by Gates and Papadimitriou (Bounds for sorting by prefix reversal, Discrete Mathematics 27, pp. 47-57), asks for the minimum number of prefix reversals required to sort the elements of a giv...

متن کامل

Computing the Reversal Distance between genomes in the Presence of Multi-gene Families via Binary Integer Programming

Hannenhalli and Pevzner developed the first polynomial-time algorithm for the combinatorial problem of sorting signed genomic data. Their algorithm determines the minimum number of reversals required for rearranging a genome to another -but only in the absence of gene duplicates. However, duplicates often account for 40% of a genome. In this paper, we show how to extend Hannenhalli and Pevzner'...

متن کامل

Prefix Reversals on Binary and Ternary Strings

Given a permutation π, the application of prefix reversal f (i) to π reverses the order of the first i elements of π. The problem of sorting by prefix reversals (also known as pancake flipping), made famous by Gates and Papadimitriou (Discrete Math., 27 (1979), pp. 47–57), asks for the minimum number of prefix reversals required to sort the elements of a given permutation. In this paper we stud...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Discrete Applied Mathematics

دوره 155  شماره 

صفحات  -

تاریخ انتشار 2005